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TECHNICAL FIELD 

This invention relates to the field of digital computer systems, and more 
particularly, to the addressing of PCI devices in a general-purpose multi-tasking 
operating system having a CPU which provides a number of address lines less 
than that required by a PCI device bus. 

BACKGROUND 

Digital computer operating systems periodically schedule processes for the 
central processing unit (CPU) of the computer to perform. The operating system 
schedules sub-parts of executable applications, known as threads, on the CPU. In 
multi-tasking operating systems, threads can be independently scheduled for 
execution on the processor so that more than one computer program or application 
may proceed at the same time. While the CPU only works on one thread at a time, 
it appears to the user that the CPU is running multiple applications concurrently. 
During multi-tasking, the CPU interrupts execution of one thread to begin 
execution of another thread or to resume execution of a previously interrupted 
thread. Switching from one thread to another is known as a "context switch." 

One multi-tasking operating system currently available is the "WINDOWS 
CE" operating system developed by Microsoft Corporation. "WINDOWS CE" 
supports a wide variety of CPUs. Many of these CPUs do not provide full 32-bit 
addressing on the external pins of the device. This presents a problem in that 
many of today's computer systems utilize fast I/O devices with high-speed 32-bit 
buses, such as the industry standard peripheral component interconnect (PCI) bus. 
PCI devices, such as video controllers, are assigned a range of contiguous 
addresses on the bus through which they may be accessed. 
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To enable access to several PCI devices, each PCI device is assigned a 
unique, non-overlapping address range, whereby the high order bits of the 32-bit 
address are used to distinguish among address spaces for different devices. For 
instance, the upper two bits of a 28-bit address might be used to differentiate 
between four different PCI devices, with the remaining 26 bits being used for 
addressing within the devices themselves. Unfortunately, the CPU itself may not 
support 28 bits, but only 26 bits. Accordingly, there is no way for the CPU to 
distinguish amongst the different PCI devices on the bus. When a CPU is 
incapable of providing the full 32-bit PCI address space, there is a need to extend 
the addressing output of the CPU to properly access a PCI device connected to a 



Special chips, referred to as "PCI host bridges," are available to interface a 
CPU to a PCI bus. Some of these chips have circuitry for extending the limited 
address range of CPUs such as those discussed above. 

Fig. 1 shows a system utilizing a PCI host bridge. The system includes a 
prior art computer 10 having a microprocessor 12 coupled to a PCI host bridge 14. 
The microprocessor 12 has a central processing unit (CPU) 16. The CPU 14 is 
connected to multiple devices, such as a PCMCIA driver 18, a universal serial bus 
(USB) 20, an I/O (input/output) port 22, and a bus interface unit (BIU) 24, via an 
address bus 26. The BUI 24 interfaces the microprocessor 12 to the PCI host 
bridge 14 via a standard address bus 30 having less than thirty-two lines. The PCI 
host bridge 14 converts addresses received from the address bus 30 to full 32-bit 
addresses output over a 32-bit PCI bus 32. One or more PCI devices are coupled 
to the PCI bus 32, as represented by devices 34a, 34b, and 34c. 



PCI bus. 
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The PCI host bridge 14 has an extension register 36 to extend the address 
on the bus 32 (i.e., less than 32 bits) to a full 32-bit address suitable for the PCI 
bus 32. More particularly, the extension register 36 contains memory to store a 
number of bits which are concatenated with lower-order bits from the standard 
address bus 30 to produce a full 32-bit address for one of the PCI devices 34. 

The extension register 36 is read from and written to by the CPU 16 to 
allow the CPU to set the value of the high order address bits. By changing the 
values in the extension register, the CPU 16 is able to address the full address 
space supported by the PCI bus 32. For example, assume that a CPU 16 outputs a 
26-bit address on bus 30. Further assume that one of the PCI devices 34 is a video 
device that has been assigned an address range of A4000000 through A7FFFFFF 
(expressed in hexadecimal notation). To address the video device, the CPU 16 
writes the binary value 101001 to the extension register 36 to set the upper six bits 
of the PCI address. The external address pins of the CPU 16 then supply the low 
order twenty-six bits during normal bus cycle generation. If the CPU is then 
required to address a different PCI device, the CPU 16 changes the value in the 
extension register 36 to match the high order address bits of the target PCI device. 
For example, if the second PCI device is mapped to the address range of 
A8000000 through ABFFFFFF, the CPU 16 writes binary value 101010 to the 
extension register 36 and supplies the lower order twenty-six bits of the address 
during the bus cycle generation. 

The procedure operates adequately within a non-multi-tasking environment, 
but not in a multi-tasking operating system environment. In present multi-tasking 
systems, the operating systems are designed to work with specific CPUs that only 
supply an adequate number of address lines, thereby avoiding the problem 
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described above. The operating system may update the extension register once at 
system initialization and never update it again since the address bits supplied by 
the extension register are common across all PCI devices. This situation does not 
pose any problems because the correct address is generated while the PCI device 
is accessed. 

However, consider the situation of a multi-tasking operating system 
working with CPUs that do not provide a sufficient number of address lines. The 
process described must be maintained even though the operating system may 
suspend or resume execution of a particular thread at any given time. This poses 
some potentially troublesome problems in that the extension register may contain 
the wrong values as a result of the context switching performed by the multi- 
tasking operating system. 

Suppose, for example, two threads A and B are accessing PCI devices 1 
and 2, respectively, via the PCI host bridge. Thread A needs the extension register 
set to a value "X" to access PCI Device 1 and thread B requires the extension 
register to have a value "Y" to access PCI Device 2. Table 1 describes the 
situation. 
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Table 1: Inter-process Conflict In A Multi-tasking 0 


perating System 


Time 


Thread A 


Thread B 


Extension 
Register Value 


TO 


Extension Register = X 




X 


Tl 


Access PCI Device 1 




X 


T2 


Context Switch 


X 


T3 




Extension Register = Y 


Y 


T4 




Access PCI Device 2 


Y 


T5 


Context Switch 


Y 


T6 


Access PCI Device 1 


Y 



At time TO, thread A requests the CPU to set the extension register to value 
X. The CPU then accesses PCI device 1 at time Tl. At time T2, the operating 
system performs a context switch and schedules thread B for execution. At time 
T3, thread B requests the CPU to set the extension register to value Y. The CPU 
then accesses PCI device 2 at time T4. 

Thereafter, the operating system performs another context switch at time T5 
to return to thread A for more execution. However, note that the extension register 
value remains at the value Y (the value for thread B) rather than the value X for 
thread A. The value in the extension register is not updated at this time because 
such an update only occurs when a thread - in this case, thread A - is initially 
called. As a result, when thread A attempts to access PCI device 1 again at time 
T6, the extension register contains value Y (i.e., the extension value for PCI 
device 2) and the access attempt fails. 

One way to resolve this problem is to disable the context switching in the 
operating system while a PCI device is being accessed. This, however, could 
compromise the performance of the operating system since it may result in "jerky" 
operation of the applications being performed and would adversely affect the 
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illusion of multi-tasking to the user. Additionally, in time-critical, real-time 
operating systems that are used in critical medical equipment or aircraft flight 
control systems, it is absolutely necessary to guarantee that context switches will 
occur within a specific time period. If context switching is disabled for an 
arbitrary period of time while a PCI device is being accessed in such a system, the 
time-critical thread may not execute within its allotted time period, thereby 
causing failure of the critical system. 

A better solution to the problem is to provide a way for the operating 
system to restore the value of the extension register prior to each context switch so 
that the correct PCI device is accessed by the operating system every time. This 
solution is accomplished by the present invention. 

SUMMARY 

The present invention concerns a multi-tasking operating system and 
method that updates extension register values to ensure that various threads utilize 
the correct values to access corresponding PCI devices. In this manner, when 
execution of an interrupted thread is resumed, the thread will be able to access the 
correct PCI device. 

When a first thread of an application program requires access to a PCI 
device, the operating system writes the high order bits of the PCI device address to 
two places: (1) the extension register of the PCI host bridge and (2) a separate 
memory location associated with the first thread. Similarly, when a second thread 
is started (i.e., the first thread is suspended) and desires access to a second PCI 
device, the operating system writes the high order bits of the second PCI device 
address to two places: (1) the extension register of the PCI host bridge and (2) a 
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second memory location associated with the second thread. In the described 
implementation, the separate memory locations are implemented as a table that 
holds the upper address bits of a PCI device addresses in correlation with the 
threads that call the devices. 

When a subsequent context switch occurs from the second thread back to 
the first thread (i.e., the second thread is suspended and execution of the first 
thread is resumed), the operating system retrieves the stored value from the 
memory location associated with the first thread and writes the value to the 
extension register. Therefore, when the first thread tries to access its particular 
PCI device, the proper value is already located in the extension register and can be 
concatenated with the CPU output address to access the PCI device. 

When the next context switch to the second thread occurs, the operating 
system retrieves the stored value from the memory location associated with the 
second thread and writes the value to the extension register. Therefore, the proper 
value is located in the extension register so that the correct PCI device is accessed 
by the second thread. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of a prior art computer having a microprocessor 
interfacing with a PCI host bridge to provide a PCI address to multiple PCI 
devices via a PCI bus. 

Fig. 2 is a block diagram of a computer having a multi-tasking operating 
system with a thread scheduler for scheduling threads in conjunction with a PCI 
host bridge to enable access to PCI devices on a PCI bus. 



Ue A Hayes, PLLC 



7 



0602991019 MS1-317US.PA TAPP.DOC 



I 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



Fig. 3 is a block diagram of the multi-tasking operating system, which 
shows an extension register initialization function and an extension register value 
table used in scheduling threads. 

Figs. 4-6 is a flow diagram of a process for managing the extension register 
to update values therein with each context switch among multiple threads. Fig. 4 
shows steps implemented by an application program (or its corresponding thread 
and driver). Fig. 5 shows steps performed by a multi-tasking operating system. 
Fig. 6 shows steps performed by the extension register initialization function. 

DETAILED DESCRIPTION 

The present invention concerns a multi-tasking operating system and 
methods that enable extension of reduced-size addresses output by a CPU to full- 
size addresses carried over a PCI bus for accessing various PCI devices. The 
multi-tasking operating system may be implemented in a number of computing 
devices. For discussion purposes, the operating system is described in the context 
of a handheld computer. 

Fig 2 is a block diagram showing pertinent functional components of a 
computer 40 in which the present invention may be implemented. The computer 
40 is similarly constructed as computer 10 in Fig. 1 with respect to microprocessor 
12, PCI host bridge 14, and PCI devices 34a-34c. The microprocessor 12 outputs 
reduced-sized addresses (e.g., less than 32 bits) on bus 30, and the PCI host bus 14 
extends the addresses to full-size 32-bit addresses for the PCI bus 32. 

The computer 40 has a nonvolatile memory 50 that stores a multi-tasking 
operating system 52 with a thread scheduler 54. The multi-tasking operating 
system 52 supports concurrent operation of multiple application programs, as 
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represented by four applications 56a, 56b, 56c, and 56d (although more or less 
applications may be used). The microprocessor 12 communicates with the 
memory 50 via a standard data bus 58, which is well known in the art. 

The multi-tasking operating system 52 assigns threads for associated 
applications 56a, 56b, 56c, and 56d. As noted in the Background, one problem 
encountered in the context of multi-tasking operating systems executing on CPUs 
with less than the number of address pins for full bus addressing occurs during 
context switching among the multiple threads. A context switch occurs whenever 
the multi-tasking operating system interrupts one thread to execute another thread. 
The problem is that, following a context switch, the extension register 36 may not 
contain the appropriate value for the current thread to access a PCI device on the 
PCI bus, but instead might contain a value used by a previously executed thread. 

To overcome this problem, the multi-tasking operating system 52 stores the 
addressing values used in the extension register 36 for each thread and updates the 
extension register 36 each time the multi-tasking operating system performs a 
context switch between two threads. More particularly, when thread A of 
application program 56a requires access to a PCI device 34a, the operating system 
writes the high order bits of the PCI device address to two places: (1) the 
extension register 36 of the PCI host bridge 14 and (2) a memory location 
associated with thread A. Similarly, when thread B is subsequently started (i.e., 
thread A is suspended) and desires access to a second PCI device, the operating 
system writes the high order bits of the second PCI device address to two places: 
(1) the extension register 36 of the PCI host bridge 14 and (2) a second memory 
location associated with the second thread. 
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When a context switch occurs from thread B back to thread A, the 
operating system retrieves the stored value from the memory location associated 
with thread A and writes the value to the extension register 36. Therefore, when 
thread A tries to access its particular PCI device, the proper address value is 
already located in the extension register and can be concatenated with the CPU 
output address to access the PCI device. 

Fig. 3 shows the memory 50 and components of the multi-tasking operating 
system 52 in more detail. In addition to thread scheduler 54, the operating system 
52 includes an extension register value table 60 and an extension register 
initialization function 62. The extension register value table 60 stores unique 
extension values used to address the PCI devices in association with the 
application programs that are accessing the PCI devices. In the illustrated 
example, the table 60 stores extension registration values 64a-64d for the four 
threads A, B, C, and D associated with application programs 56a-56d. The 
operating system also includes one or more device drivers 66. 

Figs. 4-6 show a general process for managing the extension register to 
update values therein with each context switch among multiple threads. Figs. 4 
and 5 illustrate an initializing process when an application thread decides to write 
to a peripheral device for the first time. Fig. 6 illustrates a process in which the 
operating system switches contexts among existing threads. The processes are 
described with additional reference to Figs. 2 and 3. The processes are performed 
in software by code embodied in various software components stored in memory, 
including an application program 56 (or corresponding OS-based thread) (Fig. 4), 
the extension register initialization function 62 (Fig. 5), and the operating system 
52 (Fig. 6). _ 
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With reference to Fig. 4, an application program 56a prepares to write data 
to a peripheral device 34a via the PCI host bridge 14 and PCI bus 32. The 
application program 56a is executed by a corresponding thread A, which in turn 
executes under the control and supervision of operating system 52. Prior to 
utilizing peripheral device 34a, the thread A calls the extension register 
initialization function 62 and passes in a base address for use by the extension 
register for properly addressing the peripheral device 34a (step 100 in Fig. 4). The 
base address might be a full address from which an address extension is 
calculated, or only the upper bits that form the desired address extension. The 
initialization function then performs several steps, as discussed below with respect 
to Fig. 5. Upon completion by the initialization function 62, thread A writes data 
to or reads data from the peripheral device 34a (step 102 in Fig. 6). 

The actual access of PCI devices is often accomplished through the use of 
operating system device drivers such as drivers 66 shown in Fig. 3. In these 
situations, the call to register initialization function 62, and the subsequent access 
to the PCI device itself, is made by the device driver. Each device driver is 
designed to call register initialization function 62 with a desired base address or 
address extension prior to accessing a PCI device. 

With reference to Fig. 5, when the extension register initialization function 
62 is called (step 100 in Fig. 4), the function 62 receives a base address or address 
extension value for accessing the PCI peripheral device 34a from the thread A 
(step 110 in Fig. 5). The initialization function 62 then writes the appropriate 
extension register value to the extension register 36 in PCI host bridge 14 (step 
112 in Fig. 5). In the case where register initialization function 62 receives a base 
address rather than the extension value itself, function 62 calculates the 
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appropriate extension value — in most cases by determining the value of the 
uppermost bits of the base address. 

Contemporaneously with this write operation (step 112), the extension 
register initialization function 62 writes the received extension register value or 
base address to a memory location and associates the value with the thread A (step 
114 in Fig. 5). In the described implementation, the value is stored in the 
extension register value table 60 as entry 64a. Although a table is illustrated, it 
should be understood that other techniques may be used to store the extension 
value in association with the calling thread. 

After the extension register initialization function 62 finishes, control is 
then returned to the application program thread A to write data to or read data 
from the peripheral device 34a. 

Now, suppose a second application 56b is launched. The thread scheduler 
54 of multi-tasking operating system 52 starts another thread B to execute the 
newly launched application program 56b. In addition, suppose the second 
application program 56b wants to write data to a second peripheral device 34b 
over the PCI bus 32. For this to occur, the extension register 36 in PCI host bridge 
14 must be updated if and when the thread A currently executing is interrupted by 
the thread scheduler 54 to execute the second thread B of the second application 
program 56b. 

Prior to any attempt to access the second peripheral device 34b, the thread 
B calls the extension register initialization function 62 and passes in a base address 
or extension value for use in properly addressing the peripheral device 34b (i.e., 
step 100 in Fig. 4). The extension register initialization function 62 writes the 
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received or calculated value to the extension register 36 and to the extension 
register value table 60 as entry 64b (i.e., steps 112 and 1 14 in Fig. 5). 

With multiple threads executing, the multi-tasking operating system 52 
performs context switches between the threads A and B. The operating system 52 
executes thread B of the application program 56b until it is completed or until it is 
interrupted by the thread scheduler 54 to perform a context switch to thread A of 
the first application program 56a; 

Fig. 6 shows a process for updating the extension register in response to 
context switching performed by the operating system. For discussion purposes, 
the following example assumes that thread B is currently executing and that the 
thread next scheduled for execution is thread A of the first application program 
56a. 

When the multi-tasking operating system 52 performs a context switch 
from thread B back to thread A, the operating system 52 retrieves the extension 
register value for thread A from entry 64a of extension register table 60 (step 120 
in Fig. 6). The operating system then writes the extension register value to the 
address extension register 36 so that the thread A will be able to access the first 
PCI device 34a (step 122 in Fig. 6). The multi-tasking operating system 52 then 
resumes execution of the thread of the application program 50a (step 124 in Fig. 
6). 

This context switching process of Fig. 6 may be repeated as desired for 
however many threads are being executed. 

According to another aspect of this invention, a default extension value is 
utilized for any threads that have not called register initialization function 62. 
When a thread is initiated, its corresponding entry in table 60 is initialized with 
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this default value. This default value can be overridden by any particular thread, 
by calling register initialization function 62 prior to accessing a PCI device. 

Although the invention has been described in language specific to structural 
features and/or methodological steps, it is to be understood that the invention 
defined in the appended claims is not necessarily limited to the specific features or 
steps described. Rather, the specific features and steps are disclosed as preferred 
forms of implementing the claimed invention. 
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